Search results for "neural activations"

showing 1 items of 1 documents

Studying the evolution of neural activation patterns during training of feed-forward ReLU networks

2021

The ability of deep neural networks to form powerful emergent representations of complex statistical patterns in data is as remarkable as imperfectly understood. For deep ReLU networks, these are encoded in the mixed discrete–continuous structure of linear weight matrices and non-linear binary activations. Our article develops a new technique for instrumenting such networks to efficiently record activation statistics, such as information content (entropy) and similarity of patterns, in real-world training runs. We then study the evolution of activation patterns during training for networks of different architecture using different training and initialization strategies. As a result, we see …

MultidisciplinaryArtificial IntelligenceElectronic computers. Computer sciencefeed-forward networksQA75.5-76.95activation patterns004 Informatikneural activationsRELUactivation entropy004 Data processingOriginal Research
researchProduct